Overview

Dataset statistics

Number of variables23
Number of observations2928
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory863.9 KiB
Average record size in memory302.1 B

Variable types

NUM21
CAT2

Reproduction

Analysis started2021-08-10 10:19:19.075569
Analysis finished2021-08-10 10:20:41.153233
Duration1 minute and 22.08 seconds
Versionpandas-profiling v2.7.1
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Country has a high cardinality: 183 distinct values High cardinality
under-five deaths is highly correlated with infant deathsHigh correlation
infant deaths is highly correlated with under-five deathsHigh correlation
GDP is highly correlated with percentage expenditureHigh correlation
percentage expenditure is highly correlated with GDPHigh correlation
thinness 5-9 years is highly correlated with thinness 10-19 yearsHigh correlation
thinness 10-19 years is highly correlated with thinness 5-9 yearsHigh correlation
df_index is uniformly distributed Uniform
Country is uniformly distributed Uniform
df_index has unique values Unique
infant deaths has 838 (28.6%) zeros Zeros
percentage expenditure has 606 (20.7%) zeros Zeros
Measles has 973 (33.2%) zeros Zeros
under-five deaths has 775 (26.5%) zeros Zeros
Income composition of resources has 130 (4.4%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIFORM
UNIQUE
Distinct count2928
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1467.5273224043715
Minimum0
Maximum2937
Zeros1
Zeros (%)< 0.1%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile146.35
Q1732.75
median1465.5
Q32203.25
95-th percentile2790.65
Maximum2937
Range2937
Interquartile range (IQR)1470.5

Descriptive statistics

Standard deviation848.8254023
Coefficient of variation (CV)0.5784051781
Kurtosis-1.201062795
Mean1467.527322
Median Absolute Deviation (MAD)735.5
Skewness0.002318914912
Sum4296920
Variance720504.5637
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
1080 1 < 0.1%
 
1076 1 < 0.1%
 
1074 1 < 0.1%
 
1072 1 < 0.1%
 
1070 1 < 0.1%
 
1068 1 < 0.1%
 
1066 1 < 0.1%
 
1064 1 < 0.1%
 
1062 1 < 0.1%
 
Other values (2918) 2918 99.7%
 
ValueCountFrequency (%) 
0 1 < 0.1%
 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
ValueCountFrequency (%) 
2937 1 < 0.1%
 
2936 1 < 0.1%
 
2935 1 < 0.1%
 
2934 1 < 0.1%
 
2933 1 < 0.1%
 

Country
Categorical

HIGH CARDINALITY
UNIFORM
Distinct count183
Unique (%)6.2%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
Dominican Republic
 
16
Croatia
 
16
United States of America
 
16
Cuba
 
16
Trinidad and Tobago
 
16
Other values (178)
2848
ValueCountFrequency (%) 
Dominican Republic 16 0.5%
 
Croatia 16 0.5%
 
United States of America 16 0.5%
 
Cuba 16 0.5%
 
Trinidad and Tobago 16 0.5%
 
Poland 16 0.5%
 
Côte d'Ivoire 16 0.5%
 
Malta 16 0.5%
 
Kiribati 16 0.5%
 
Eritrea 16 0.5%
 
Other values (173) 2768 94.5%
 

Length

Max length52
Mean length10.04371585
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 27 48.2%
 
Uppercase_Letter 24 42.9%
 
Close_Punctuation 1 1.8%
 
Open_Punctuation 1 1.8%
 
Space_Separator 1 1.8%
 
Other_Punctuation 1 1.8%
 
Dash_Punctuation 1 1.8%
 
ValueCountFrequency (%) 
Latin 51 91.1%
 
Common 5 8.9%
 
ValueCountFrequency (%) 
ASCII 55 98.2%
 
Latin 1 Sup 1 1.8%
 

Year
Real number (ℝ≥0)

Distinct count16
Unique (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2007.5
Minimum2000
Maximum2015
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum2000
5-th percentile2000
Q12003.75
median2007.5
Q32011.25
95-th percentile2015
Maximum2015
Range15
Interquartile range (IQR)7.5

Descriptive statistics

Standard deviation4.610559618
Coefficient of variation (CV)0.002296667307
Kurtosis-1.209427576
Mean2007.5
Median Absolute Deviation (MAD)4
Skewness0
Sum5877960
Variance21.25725999
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2015 183 6.2%
 
2013 183 6.2%
 
2011 183 6.2%
 
2009 183 6.2%
 
2007 183 6.2%
 
2005 183 6.2%
 
2003 183 6.2%
 
2001 183 6.2%
 
2014 183 6.2%
 
2012 183 6.2%
 
Other values (6) 1098 37.5%
 
ValueCountFrequency (%) 
2000 183 6.2%
 
2001 183 6.2%
 
2002 183 6.2%
 
2003 183 6.2%
 
2004 183 6.2%
 
ValueCountFrequency (%) 
2015 183 6.2%
 
2014 183 6.2%
 
2013 183 6.2%
 
2012 183 6.2%
 
2011 183 6.2%
 

Status
Categorical

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size23.0 KiB
Developing
2416
Developed
512
ValueCountFrequency (%) 
Developing 2416 82.5%
 
Developed 512 17.5%
 

Length

Max length10
Mean length9.825136612
Min length9
ValueCountFrequency (%) 
Lowercase_Letter 9 90.0%
 
Uppercase_Letter 1 10.0%
 
ValueCountFrequency (%) 
Latin 10 100.0%
 
ValueCountFrequency (%) 
ASCII 10 100.0%
 

Life expectancy
Real number (ℝ≥0)

Distinct count362
Unique (%)12.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.22493169398908
Minimum36.3
Maximum89.0
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum36.3
5-th percentile51.4
Q163.1
median72.1
Q375.7
95-th percentile82
Maximum89
Range52.7
Interquartile range (IQR)12.6

Descriptive statistics

Standard deviation9.523867488
Coefficient of variation (CV)0.1375785754
Kurtosis-0.2344773942
Mean69.22493169
Median Absolute Deviation (MAD)5.8
Skewness-0.6386047359
Sum202690.6
Variance90.70405193
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
73 45 1.5%
 
75 33 1.1%
 
78 31 1.1%
 
73.6 28 1.0%
 
76 25 0.9%
 
73.9 25 0.9%
 
81 25 0.9%
 
74.7 24 0.8%
 
74.5 24 0.8%
 
74.2 23 0.8%
 
Other values (352) 2645 90.3%
 
ValueCountFrequency (%) 
36.3 1 < 0.1%
 
39 1 < 0.1%
 
41 1 < 0.1%
 
41.5 1 < 0.1%
 
42.3 1 < 0.1%
 
ValueCountFrequency (%) 
89 11 0.4%
 
88 10 0.3%
 
87 9 0.3%
 
86 15 0.5%
 
85 12 0.4%
 

Adult Mortality
Real number (ℝ≥0)

Distinct count425
Unique (%)14.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean164.79644808743168
Minimum1.0
Maximum723.0
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum1
5-th percentile13
Q174
median144
Q3228
95-th percentile398.3
Maximum723
Range722
Interquartile range (IQR)154

Descriptive statistics

Standard deviation124.292079
Coefficient of variation (CV)0.754215764
Kurtosis1.748860208
Mean164.7964481
Median Absolute Deviation (MAD)76
Skewness1.174369488
Sum482524
Variance15448.5209
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
12 34 1.2%
 
14 30 1.0%
 
16 29 1.0%
 
138 25 0.9%
 
11 25 0.9%
 
19 23 0.8%
 
144 22 0.8%
 
13 21 0.7%
 
15 21 0.7%
 
17 21 0.7%
 
Other values (415) 2677 91.4%
 
ValueCountFrequency (%) 
1 12 0.4%
 
2 8 0.3%
 
3 6 0.2%
 
4 4 0.1%
 
5 2 0.1%
 
ValueCountFrequency (%) 
723 1 < 0.1%
 
717 1 < 0.1%
 
715 1 < 0.1%
 
699 1 < 0.1%
 
693 1 < 0.1%
 

infant deaths
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count209
Unique (%)7.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.407445355191257
Minimum0
Maximum1800
Zeros838
Zeros (%)28.6%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median3
Q322
95-th percentile94.65
Maximum1800
Range1800
Interquartile range (IQR)22

Descriptive statistics

Standard deviation118.1144496
Coefficient of variation (CV)3.884392399
Kurtosis115.6574795
Mean30.40744536
Median Absolute Deviation (MAD)3
Skewness9.771044493
Sum89033
Variance13951.0232
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 838 28.6%
 
1 342 11.7%
 
2 203 6.9%
 
3 175 6.0%
 
4 96 3.3%
 
8 57 1.9%
 
7 53 1.8%
 
10 48 1.6%
 
9 48 1.6%
 
6 46 1.6%
 
Other values (199) 1022 34.9%
 
ValueCountFrequency (%) 
0 838 28.6%
 
1 342 11.7%
 
2 203 6.9%
 
3 175 6.0%
 
4 96 3.3%
 
ValueCountFrequency (%) 
1800 2 0.1%
 
1700 2 0.1%
 
1600 1 < 0.1%
 
1500 2 0.1%
 
1400 1 < 0.1%
 

Alcohol
Real number (ℝ≥0)

Distinct count1076
Unique (%)36.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.559166666666667
Minimum0.01
Maximum17.87
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.01
5-th percentile0.01
Q11.1075
median3.77
Q37.4
95-th percentile11.89
Maximum17.87
Range17.86
Interquartile range (IQR)6.2925

Descriptive statistics

Standard deviation3.920534029
Coefficient of variation (CV)0.8599233842
Kurtosis-0.6268158983
Mean4.559166667
Median Absolute Deviation (MAD)3.1
Skewness0.6470018137
Sum13349.24
Variance15.37058707
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.01 280 9.6%
 
3.77 196 6.7%
 
0.03 15 0.5%
 
0.04 13 0.4%
 
0.02 12 0.4%
 
0.09 12 0.4%
 
1.18 10 0.3%
 
0.06 10 0.3%
 
0.21 10 0.3%
 
0.56 9 0.3%
 
Other values (1066) 2361 80.6%
 
ValueCountFrequency (%) 
0.01 280 9.6%
 
0.02 12 0.4%
 
0.03 15 0.5%
 
0.04 13 0.4%
 
0.05 9 0.3%
 
ValueCountFrequency (%) 
17.87 1 < 0.1%
 
17.31 1 < 0.1%
 
16.99 1 < 0.1%
 
16.58 1 < 0.1%
 
16.35 1 < 0.1%
 

percentage expenditure
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count2323
Unique (%)79.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean740.3211850204337
Minimum0.0
Maximum19479.91161
Zeros606
Zeros (%)20.7%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14.853963995
median65.61145482
Q3442.6143215
95-th percentile4507.913607
Maximum19479.91161
Range19479.91161
Interquartile range (IQR)437.7603575

Descriptive statistics

Standard deviation1990.930605
Coefficient of variation (CV)2.689279525
Kurtosis26.47582908
Mean740.321185
Median Absolute Deviation (MAD)65.61145482
Skewness4.643789672
Sum2167660.43
Variance3963804.673
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 606 20.7%
 
345.9044258 1 < 0.1%
 
2698.01817 1 < 0.1%
 
3.43334364 1 < 0.1%
 
8.758214538 1 < 0.1%
 
5.103249438 1 < 0.1%
 
70.27113179 1 < 0.1%
 
6164.455402 1 < 0.1%
 
0.962497052 1 < 0.1%
 
253.4022338 1 < 0.1%
 
Other values (2313) 2313 79.0%
 
ValueCountFrequency (%) 
0 606 20.7%
 
0.09987219 1 < 0.1%
 
0.108055973 1 < 0.1%
 
0.27564826 1 < 0.1%
 
0.328418056 1 < 0.1%
 
ValueCountFrequency (%) 
19479.91161 1 < 0.1%
 
19099.04506 1 < 0.1%
 
18961.3486 1 < 0.1%
 
18822.86732 1 < 0.1%
 
18379.32974 1 < 0.1%
 

Hepatitis B
Real number (ℝ≥0)

Distinct count87
Unique (%)3.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.0457650273224
Minimum1.0
Maximum99.0
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum1
5-th percentile9
Q182
median92
Q396
95-th percentile99
Maximum99
Range98
Interquartile range (IQR)14

Descriptive statistics

Standard deviation22.94204659
Coefficient of variation (CV)0.2762578752
Kurtosis4.43071141
Mean83.04576503
Median Absolute Deviation (MAD)5
Skewness-2.286144918
Sum243158
Variance526.3375017
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
92 645 22.0%
 
99 237 8.1%
 
98 209 7.1%
 
96 166 5.7%
 
97 154 5.3%
 
95 149 5.1%
 
94 127 4.3%
 
93 101 3.4%
 
91 75 2.6%
 
89 71 2.4%
 
Other values (77) 994 33.9%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
2 4 0.1%
 
4 4 0.1%
 
5 9 0.3%
 
6 17 0.6%
 
ValueCountFrequency (%) 
99 237 8.1%
 
98 209 7.1%
 
97 154 5.3%
 
96 166 5.7%
 
95 149 5.1%
 

Measles
Real number (ℝ≥0)

ZEROS
Distinct count958
Unique (%)32.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2427.85587431694
Minimum0
Maximum212183
Zeros973
Zeros (%)33.2%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median17
Q3362.25
95-th percentile9994.05
Maximum212183
Range212183
Interquartile range (IQR)362.25

Descriptive statistics

Standard deviation11485.97094
Coefficient of variation (CV)4.73091136
Kurtosis114.4679785
Mean2427.855874
Median Absolute Deviation (MAD)17
Skewness9.425290043
Sum7108762
Variance131927528.4
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 973 33.2%
 
1 104 3.6%
 
2 68 2.3%
 
3 44 1.5%
 
4 33 1.1%
 
6 29 1.0%
 
7 28 1.0%
 
5 25 0.9%
 
8 24 0.8%
 
9 22 0.8%
 
Other values (948) 1578 53.9%
 
ValueCountFrequency (%) 
0 973 33.2%
 
1 104 3.6%
 
2 68 2.3%
 
3 44 1.5%
 
4 33 1.1%
 
ValueCountFrequency (%) 
212183 1 < 0.1%
 
182485 1 < 0.1%
 
168107 1 < 0.1%
 
141258 1 < 0.1%
 
133802 1 < 0.1%
 

BMI
Real number (ℝ≥0)

Distinct count603
Unique (%)20.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38.29129098360656
Minimum1.0
Maximum77.6
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum1
5-th percentile5.2
Q119.4
median43.35
Q356.1
95-th percentile64.5
Maximum77.6
Range76.6
Interquartile range (IQR)36.7

Descriptive statistics

Standard deviation19.85730792
Coefficient of variation (CV)0.5185854906
Kurtosis-1.294107361
Mean38.29129098
Median Absolute Deviation (MAD)16.15
Skewness-0.239841994
Sum112116.9
Variance394.3126778
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
43.35 32 1.1%
 
58.5 18 0.6%
 
55.8 16 0.5%
 
57 16 0.5%
 
59.9 15 0.5%
 
54.2 15 0.5%
 
59.3 14 0.5%
 
55 13 0.4%
 
52.8 13 0.4%
 
59.4 13 0.4%
 
Other values (593) 2763 94.4%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
1.4 2 0.1%
 
1.8 1 < 0.1%
 
1.9 1 < 0.1%
 
2 1 < 0.1%
 
ValueCountFrequency (%) 
77.6 1 < 0.1%
 
77.1 1 < 0.1%
 
76.7 1 < 0.1%
 
76.2 1 < 0.1%
 
75.7 1 < 0.1%
 

under-five deaths
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS
Distinct count252
Unique (%)8.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.17930327868852
Minimum0
Maximum2500
Zeros775
Zeros (%)26.5%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median4
Q328
95-th percentile138
Maximum2500
Range2500
Interquartile range (IQR)28

Descriptive statistics

Standard deviation160.7005471
Coefficient of variation (CV)3.809938395
Kurtosis109.3884348
Mean42.17930328
Median Absolute Deviation (MAD)4
Skewness9.479622923
Sum123501
Variance25824.66582
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0 775 26.5%
 
1 361 12.3%
 
2 163 5.6%
 
4 161 5.5%
 
3 129 4.4%
 
12 53 1.8%
 
8 49 1.7%
 
6 48 1.6%
 
10 47 1.6%
 
5 44 1.5%
 
Other values (242) 1098 37.5%
 
ValueCountFrequency (%) 
0 775 26.5%
 
1 361 12.3%
 
2 163 5.6%
 
3 129 4.4%
 
4 161 5.5%
 
ValueCountFrequency (%) 
2500 1 < 0.1%
 
2400 1 < 0.1%
 
2300 1 < 0.1%
 
2200 1 < 0.1%
 
2100 1 < 0.1%
 

Polio
Real number (ℝ≥0)

Distinct count73
Unique (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean82.61612021857924
Minimum3.0
Maximum99.0
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum3
5-th percentile9
Q178
median93
Q397
95-th percentile99
Maximum99
Range96
Interquartile range (IQR)19

Descriptive statistics

Standard deviation23.35563438
Coefficient of variation (CV)0.2827006923
Kurtosis3.829795249
Mean82.61612022
Median Absolute Deviation (MAD)6
Skewness-2.108850504
Sum241900
Variance545.4856574
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
99 373 12.7%
 
98 254 8.7%
 
97 205 7.0%
 
96 205 7.0%
 
95 180 6.1%
 
94 159 5.4%
 
93 139 4.7%
 
92 96 3.3%
 
91 88 3.0%
 
88 70 2.4%
 
Other values (63) 1159 39.6%
 
ValueCountFrequency (%) 
3 7 0.2%
 
4 11 0.4%
 
5 8 0.3%
 
6 11 0.4%
 
7 24 0.8%
 
ValueCountFrequency (%) 
99 373 12.7%
 
98 254 8.7%
 
97 205 7.0%
 
96 205 7.0%
 
95 180 6.1%
 

Total expenditure
Real number (ℝ≥0)

Distinct count816
Unique (%)27.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.916256830601094
Minimum0.37
Maximum17.6
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.37
5-th percentile1.9835
Q14.37
median5.75
Q37.33
95-th percentile9.6865
Maximum17.6
Range17.23
Interquartile range (IQR)2.96

Descriptive statistics

Standard deviation2.385962519
Coefficient of variation (CV)0.4032892059
Kurtosis1.338509037
Mean5.916256831
Median Absolute Deviation (MAD)1.45
Skewness0.6179607831
Sum17322.8
Variance5.692817142
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5.75 229 7.8%
 
4.6 15 0.5%
 
6.7 12 0.4%
 
5.6 11 0.4%
 
5.64 10 0.3%
 
5.3 10 0.3%
 
3.4 10 0.3%
 
5.9 10 0.3%
 
9.1 10 0.3%
 
5.25 10 0.3%
 
Other values (806) 2601 88.8%
 
ValueCountFrequency (%) 
0.37 1 < 0.1%
 
0.65 1 < 0.1%
 
0.74 1 < 0.1%
 
0.76 1 < 0.1%
 
0.92 1 < 0.1%
 
ValueCountFrequency (%) 
17.6 1 < 0.1%
 
17.2 2 0.1%
 
17.14 1 < 0.1%
 
17 1 < 0.1%
 
16.9 1 < 0.1%
 

Diphtheria
Real number (ℝ≥0)

Distinct count81
Unique (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean82.39071038251366
Minimum2.0
Maximum99.0
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum2
5-th percentile9
Q178
median93
Q397
95-th percentile99
Maximum99
Range97
Interquartile range (IQR)19

Descriptive statistics

Standard deviation23.64513172
Coefficient of variation (CV)0.2869878365
Kurtosis3.609373309
Mean82.39071038
Median Absolute Deviation (MAD)5
Skewness-2.083449761
Sum241240
Variance559.092254
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
99 347 11.9%
 
98 253 8.6%
 
97 205 7.0%
 
95 200 6.8%
 
96 199 6.8%
 
94 149 5.1%
 
93 139 4.7%
 
92 100 3.4%
 
91 91 3.1%
 
89 76 2.6%
 
Other values (71) 1169 39.9%
 
ValueCountFrequency (%) 
2 1 < 0.1%
 
3 4 0.1%
 
4 12 0.4%
 
5 10 0.3%
 
6 16 0.5%
 
ValueCountFrequency (%) 
99 347 11.9%
 
98 253 8.6%
 
97 205 7.0%
 
96 199 6.8%
 
95 200 6.8%
 

HIV/AIDS
Real number (ℝ≥0)

Distinct count200
Unique (%)6.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.7477117486338798
Minimum0.1
Maximum50.6
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.1
5-th percentile0.1
Q10.1
median0.1
Q30.8
95-th percentile8.565
Maximum50.6
Range50.5
Interquartile range (IQR)0.7

Descriptive statistics

Standard deviation5.08554241
Coefficient of variation (CV)2.909829046
Kurtosis34.76639831
Mean1.747711749
Median Absolute Deviation (MAD)0
Skewness5.386623166
Sum5117.3
Variance25.8627416
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.1 1771 60.5%
 
0.2 124 4.2%
 
0.3 115 3.9%
 
0.4 69 2.4%
 
0.5 42 1.4%
 
0.6 35 1.2%
 
0.8 32 1.1%
 
0.9 32 1.1%
 
0.7 29 1.0%
 
1.5 21 0.7%
 
Other values (190) 658 22.5%
 
ValueCountFrequency (%) 
0.1 1771 60.5%
 
0.2 124 4.2%
 
0.3 115 3.9%
 
0.4 69 2.4%
 
0.5 42 1.4%
 
ValueCountFrequency (%) 
50.6 1 < 0.1%
 
50.3 1 < 0.1%
 
49.9 1 < 0.1%
 
49.1 1 < 0.1%
 
48.8 1 < 0.1%
 

GDP
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count2485
Unique (%)84.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6627.389706998224
Minimum1.68135
Maximum119172.7418
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum1.68135
5-th percentile81.7460741
Q1578.7970947
median1764.97387
Q34793.630903
95-th percentile37695.26354
Maximum119172.7418
Range119171.0605
Interquartile range (IQR)4214.833808

Descriptive statistics

Standard deviation13316.39253
Coefficient of variation (CV)2.00929674
Kurtosis15.08035247
Mean6627.389707
Median Absolute Deviation (MAD)1427.131622
Skewness3.536630377
Sum19404997.06
Variance177326310.1
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1764.97387 444 15.2%
 
965.6693779 1 < 0.1%
 
1276.265 1 < 0.1%
 
3638.95946 1 < 0.1%
 
2158.299 1 < 0.1%
 
1768.92132 1 < 0.1%
 
261.456882 1 < 0.1%
 
558.221144 1 < 0.1%
 
38532.488 1 < 0.1%
 
5.6687264 1 < 0.1%
 
Other values (2475) 2475 84.5%
 
ValueCountFrequency (%) 
1.68135 1 < 0.1%
 
3.685949 1 < 0.1%
 
4.6135745 1 < 0.1%
 
5.6687264 1 < 0.1%
 
8.376432 1 < 0.1%
 
ValueCountFrequency (%) 
119172.7418 1 < 0.1%
 
115761.577 1 < 0.1%
 
114293.8433 1 < 0.1%
 
113751.85 1 < 0.1%
 
89739.7117 1 < 0.1%
 

Population
Real number (ℝ≥0)

Distinct count2278
Unique (%)77.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10263150.47795082
Minimum34.0
Maximum1293859294.0
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum34
5-th percentile14936.05
Q1418120.5
median1391756.5
Q34592776.75
95-th percentile41358873.85
Maximum1293859294
Range1293859260
Interquartile range (IQR)4174656.25

Descriptive statistics

Standard deviation54111788.44
Coefficient of variation (CV)5.272434479
Kurtosis379.866251
Mean10263150.48
Median Absolute Deviation (MAD)1224063.5
Skewness17.9433003
Sum3.00505046e+10
Variance2.928085649e+15
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1391756.5 644 22.0%
 
444 4 0.1%
 
1141 2 0.1%
 
26868 2 0.1%
 
718239 2 0.1%
 
127445 2 0.1%
 
851967 1 < 0.1%
 
13281 1 < 0.1%
 
216375 1 < 0.1%
 
322817 1 < 0.1%
 
Other values (2268) 2268 77.5%
 
ValueCountFrequency (%) 
34 1 < 0.1%
 
36 1 < 0.1%
 
41 1 < 0.1%
 
43 1 < 0.1%
 
123 1 < 0.1%
 
ValueCountFrequency (%) 
1293859294 1 < 0.1%
 
1179681239 1 < 0.1%
 
1161977719 1 < 0.1%
 
1144118674 1 < 0.1%
 
1126135777 1 < 0.1%
 

thinness 10-19 years
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count200
Unique (%)6.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.833674863387978
Minimum0.1
Maximum27.7
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.1
5-th percentile0.6
Q11.6
median3.3
Q37.1
95-th percentile13.8
Maximum27.7
Range27.6
Interquartile range (IQR)5.5

Descriptive statistics

Standard deviation4.399552717
Coefficient of variation (CV)0.9101879712
Kurtosis4.050770945
Mean4.833674863
Median Absolute Deviation (MAD)2.3
Skewness1.727661542
Sum14153
Variance19.35606411
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1 74 2.5%
 
3.3 70 2.4%
 
1.9 65 2.2%
 
0.8 64 2.2%
 
0.7 63 2.2%
 
1.2 62 2.1%
 
2.1 61 2.1%
 
1.5 60 2.0%
 
2.2 58 2.0%
 
2 57 1.9%
 
Other values (190) 2294 78.3%
 
ValueCountFrequency (%) 
0.1 23 0.8%
 
0.2 39 1.3%
 
0.3 32 1.1%
 
0.4 5 0.2%
 
0.5 35 1.2%
 
ValueCountFrequency (%) 
27.7 1 < 0.1%
 
27.5 1 < 0.1%
 
27.4 1 < 0.1%
 
27.3 1 < 0.1%
 
27.2 2 0.1%
 

thinness 5-9 years
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count207
Unique (%)7.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.865232240437158
Minimum0.1
Maximum28.6
Zeros0
Zeros (%)0.0%
Memory size23.0 KiB

Quantile statistics

Minimum0.1
5-th percentile0.5
Q11.6
median3.4
Q37.2
95-th percentile13.8
Maximum28.6
Range28.5
Interquartile range (IQR)5.6

Descriptive statistics

Standard deviation4.487535051
Coefficient of variation (CV)0.9223681069
Kurtosis4.443803209
Mean4.86523224
Median Absolute Deviation (MAD)2.4
Skewness1.793649837
Sum14245.4
Variance20.13797084
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3.4 79 2.7%
 
0.9 69 2.4%
 
1.1 67 2.3%
 
0.5 63 2.2%
 
1.9 63 2.2%
 
1 62 2.1%
 
2.1 61 2.1%
 
1.3 59 2.0%
 
1.5 57 1.9%
 
1.7 55 1.9%
 
Other values (197) 2293 78.3%
 
ValueCountFrequency (%) 
0.1 31 1.1%
 
0.2 45 1.5%
 
0.3 25 0.9%
 
0.4 17 0.6%
 
0.5 63 2.2%
 
ValueCountFrequency (%) 
28.6 1 < 0.1%
 
28.5 1 < 0.1%
 
28.4 1 < 0.1%
 
28.3 1 < 0.1%
 
28.2 1 < 0.1%
 

Income composition of resources
Real number (ℝ≥0)

ZEROS
Distinct count625
Unique (%)21.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6301280737704917
Minimum0.0
Maximum0.948
Zeros130
Zeros (%)4.4%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile0.291
Q10.504
median0.677
Q30.773
95-th percentile0.89
Maximum0.948
Range0.948
Interquartile range (IQR)0.269

Descriptive statistics

Standard deviation0.2054400437
Coefficient of variation (CV)0.3260290284
Kurtosis1.67696235
Mean0.6301280738
Median Absolute Deviation (MAD)0.118
Skewness-1.208178493
Sum1845.015
Variance0.04220561154
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.677 169 5.8%
 
0 130 4.4%
 
0.7 17 0.6%
 
0.739 13 0.4%
 
0.636 12 0.4%
 
0.714 12 0.4%
 
0.86 11 0.4%
 
0.703 11 0.4%
 
0.723 11 0.4%
 
0.734 11 0.4%
 
Other values (615) 2531 86.4%
 
ValueCountFrequency (%) 
0 130 4.4%
 
0.253 1 < 0.1%
 
0.255 1 < 0.1%
 
0.261 1 < 0.1%
 
0.266 1 < 0.1%
 
ValueCountFrequency (%) 
0.948 1 < 0.1%
 
0.945 1 < 0.1%
 
0.942 1 < 0.1%
 
0.941 1 < 0.1%
 
0.939 1 < 0.1%
 

Schooling
Real number (ℝ≥0)

Distinct count173
Unique (%)5.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.016051912568306
Minimum0.0
Maximum20.7
Zeros26
Zeros (%)0.9%
Memory size23.0 KiB

Quantile statistics

Minimum0
5-th percentile6
Q110.3
median12.3
Q314.1
95-th percentile16.8
Maximum20.7
Range20.7
Interquartile range (IQR)3.8

Descriptive statistics

Standard deviation3.254407306
Coefficient of variation (CV)0.2708383194
Kurtosis1.069917167
Mean12.01605191
Median Absolute Deviation (MAD)1.9
Skewness-0.6152058031
Sum35183
Variance10.59116691
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
12.3 204 7.0%
 
12.9 58 2.0%
 
13.3 52 1.8%
 
12.5 49 1.7%
 
12.8 46 1.6%
 
12.6 43 1.5%
 
12.4 42 1.4%
 
10.7 41 1.4%
 
11.9 41 1.4%
 
11.7 40 1.4%
 
Other values (163) 2312 79.0%
 
ValueCountFrequency (%) 
0 26 0.9%
 
2.8 1 < 0.1%
 
2.9 4 0.1%
 
3 1 < 0.1%
 
3.1 1 < 0.1%
 
ValueCountFrequency (%) 
20.7 1 < 0.1%
 
20.6 1 < 0.1%
 
20.5 1 < 0.1%
 
20.4 3 0.1%
 
20.3 4 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

df_indexCountryYearStatusLife expectancyAdult Mortalityinfant deathsAlcoholpercentage expenditureHepatitis BMeaslesBMIunder-five deathsPolioTotal expenditureDiphtheriaHIV/AIDSGDPPopulationthinness 10-19 yearsthinness 5-9 yearsIncome composition of resourcesSchooling
00Afghanistan2015Developing65.0263.0620.0171.27962465.0115419.1836.08.1665.00.1584.25921033736494.017.217.30.47910.1
11Afghanistan2014Developing59.9271.0640.0173.52358262.049218.68658.08.1862.00.1612.696514327582.017.517.50.47610.0
22Afghanistan2013Developing59.9268.0660.0173.21924364.043018.18962.08.1364.00.1631.74497631731688.017.717.70.4709.9
33Afghanistan2012Developing59.5272.0690.0178.18421567.0278717.69367.08.5267.00.1669.9590003696958.017.918.00.4639.8
44Afghanistan2011Developing59.2275.0710.017.09710968.0301317.29768.07.8768.00.163.5372312978599.018.218.20.4549.5
55Afghanistan2010Developing58.8279.0740.0179.67936766.0198916.710266.09.2066.00.1553.3289402883167.018.418.40.4489.2
66Afghanistan2009Developing58.6281.0770.0156.76221763.0286116.210663.09.4263.00.1445.893298284331.018.618.70.4348.9
77Afghanistan2008Developing58.1287.0800.0325.87392564.0159915.711064.08.3364.00.1373.3611162729431.018.818.90.4338.7
88Afghanistan2007Developing57.5295.0820.0210.91015663.0114115.211363.06.7363.00.1369.83579626616792.019.019.10.4158.4
99Afghanistan2006Developing57.3295.0840.0317.17151864.0199014.711658.07.4358.00.1272.5637702589345.019.219.30.4058.1

Last rows

df_indexCountryYearStatusLife expectancyAdult Mortalityinfant deathsAlcoholpercentage expenditureHepatitis BMeaslesBMIunder-five deathsPolioTotal expenditureDiphtheriaHIV/AIDSGDPPopulationthinness 10-19 yearsthinness 5-9 yearsIncome composition of resourcesSchooling
29182928Zimbabwe2009Developing50.0587.0304.641.04002173.085329.04569.06.2673.018.165.8241211381599.07.57.40.4199.9
29192929Zimbabwe2008Developing48.2632.0303.5620.84342975.0028.64675.04.9675.020.5325.67857313558469.07.87.80.4219.7
29202930Zimbabwe2007Developing46.667.0293.8829.81456672.024228.24673.04.4773.023.7396.9982171332999.08.28.20.4149.6
29212931Zimbabwe2006Developing45.47.0284.5734.26216968.021227.94571.05.127.026.8414.79623213124267.08.68.60.4089.5
29222932Zimbabwe2005Developing44.6717.0284.148.71740965.042027.54369.06.4468.030.3444.765750129432.09.09.00.4069.3
29232933Zimbabwe2004Developing44.3723.0274.360.00000068.03127.14267.07.1365.033.6454.36665412777511.09.49.40.4079.2
29242934Zimbabwe2003Developing44.5715.0264.060.0000007.099826.7417.06.5268.036.7453.35115512633897.09.89.90.4189.5
29252935Zimbabwe2002Developing44.873.0254.430.00000073.030426.34073.06.5371.039.857.348340125525.01.21.30.42710.0
29262936Zimbabwe2001Developing45.3686.0251.720.00000076.052925.93976.06.1675.042.1548.58731212366165.01.61.70.4279.8
29272937Zimbabwe2000Developing46.0665.0241.680.00000079.0148325.53978.07.1078.043.5547.35887912222251.011.011.20.4349.8